Automating, Operationalizing and Productizing Journalistic Article Analysis

نویسندگان

  • Eric Kingery
  • Michael S. Manley
  • Daniel Ratner
چکیده

Public Good Software’s products match journalistic articles and other narrative content to relevant charitable causes and nonprofit organizations so that readers can take action on the issues raised by the articles’ publishers. Previously an expensive and laborintensive process, application of machine learning and other automated textual analyses now allow us to scale this matching process to the volume of content produced daily by multiple large national media outlets. This paper describes the development of a layered system of tactics working across a general news model that minimizes the need for human curation while maintaining the particular focus of concern for each individual publication. We present a number of general strategies for categorizing heterogenous texts, and suggest editorial and operational tactics for publishers to make their publications and individual content items more efficiently analyzed by automated systems. 1.Introduction: The Basic Problems In 2013, Public Good Software (PGS) introduced a product designed to direct readers concerned with the causes they read about in journalism and other media with the nonprofit organizations working on those causes. For example, an article about a shooting in Chicago would link to local anti-violence organizations so a reader could donate, volunteer, or just spread the word. This product, called “Public Good for Media” (PGM) initially debuted as a Javascript-powered “Take Action Button” that publications could insert into the HTML source of any page (see Figure 1). This first iteration of the product required explicit configuration of each insertion by a publication’s editorial staff with the assistance of PGS experts in curating the PGS marketplace’s collection of organizations and causes. Explicit configuration quickly proved untenable for both the publications and PGS in terms of staff time and expense, publishing workflow modifications and lack of growth in traffic driven through the PGM program. Figure 1: The Take Action Button embedded in an article To address this, PGS created tools to automate the process so that the PGM product could be inserted into a site-wide template and relieve the media outlets’ personnel of the burden of remembering how and when to include the PGM program as part of the publication process. Publishers also requested that PGS provide methods of automatically assigning the button’s destination based on the content of each individual story, as well as the suppression of the button’s display when there was no sensible destination, or when a story was not in any way cause-related (entertainment and sports, for example). Finally, while directing readers to nonprofits with a national area of effect was often sufficient, many publications either had more specifically located audiences, or wanted to highlight organizations working in the particular geographic area mentioned in the article. Bloomberg Data for Good Exchange Conference. 24-Sep-2017, New York City, NY, USA.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Analysis of Lexical Bundles in Journalistic Writing in English and Persian: A Contrastive Linguistic Perspective

  This paper investigates the use of ‘lexical bundles’ in two broad corpora of journalistic writing. The aim of this study is to compare the use of lexical bundles in the two domains, one consisted of newspaper articles written in English and published in England and the other one comprised of newspaper articles written in Persian from Iranian publications. For this purpose, the frequency...

متن کامل

From Academic to Journalistic Texts: A Qualitative Analysis of the Evaluative Language of Science

This study examined academic articles and journalistic reports in 5 disciplinary areas to explore how similar contents might attitudinally be realized in two different genres. To this end, 25 research articles and 210 news reports were carefully selected and underwent detailed discourse semantic and grammatical analyses with the purpose of identifying the evaluative linguistic patterns....

متن کامل

A Comparative Analysis of Lexical Bundles in Journalistic Writing in English and Persian: A Contrastive Linguistic Perspective

  This paper investigates the use of ‘lexical bundles’ in two broad corpora of journalistic writing. The aim of this study is to compare the use of lexical bundles in the two domains, one consisted of newspaper articles written in English and published in England and the other one comprised of newspaper articles written in Persian from Iranian publications. For this purpose, the frequency...

متن کامل

Prediction of Average and Perceived Polarity in Online Journalism

We predicted the average and perceived journalistic objectivity in online news articles based on the text of the article and data collected on the individual user reading the article, in order to better understand both how journalistic bias relates to the text of an article alone, and how reader demographics and political leanings influence the perceived objectivity of a given article. Using su...

متن کامل

Competition of Discourses in Journalistic Translation: Diplomatic Negotiations in Focus

We sought to understand  whether,  how, and  why the translated journalistic texts  related  to  the  Iranian  nuclear  negotiations  were manipulated. To this end,  we  monitored  a news agency’s Webpage in a time span of 46 days that began 3 days before Almaty I nuclear talks and ended  3  days  after  Almaty  II  talks.  Monitoring  resulted  in  a  corpus  made  up  of  36  target  texts  p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1710.08522  شماره 

صفحات  -

تاریخ انتشار 2017